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IMPROVED VACCINES 

The invention relates to vaccines. 

Background of the Invention 
This invention was made in the course of work 
5 supported by the United States Government, which has 
certain rights in the invention. 

Enteric f evers and diarrheal diseases, e.g., 
typhoid fever and cholera, are major causes of morbidity 
- -.and_inqrtal^ the developing world, Hook et 

10 al., 1980, In Harrison's Principles of Internal Medicine, 
9th Ed. , 641-848, McGraw Hill, New York. Traditional 
approaches to the development of vaccines for bacterial 
diseases include the parenteral injection of purified 
components or killed organisms. These parenterally 
15 administered vaccines require technologically advanced 
preparation, are relatively expensive, and are often, 
because of dislike for needle-based injections, resisted^ 
by patients. Live oral vaccine strains have several 
advantages over parenteral vaccines : low cost , ease of 
20 administration, and simple preparation. 

The development of live vaccines has often been 
limited by a lack of understanding of the pathogenesis of 
the disease of interest on a molecular level. Candidate 
live vaccine strains require nonrevertible genetic 
25 alterations that affect the virulence of the organism, 
but not its induction of an immune response. Work 
defining the mechanisms of toxigenesis of vibrio cholerae 
has made it possible to create live vaccine strains based 
on deletion of the toxin genes, Mekalanos et al., 1983, 
30 Nature 306:551, Levine et al., 1988, Infect. Immun. 
56:161. 

Recent studies have begun to define the molecular 
basis of Salmonella typhimurium macrophage survival and 
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virulence, Miller et al. , 1989, Proc. Natl. Acad. Sci. 
USA 865 5054, hereby incorporated by reference. 
Salmonella typhimurium strains with mutations. in the 
positive regulatory regulon phoP are markedly attenuated 
5 in virulence for BALB/c mice. The phoP regulon is 

composed of two genes present in an operon, termed phoP 
and phoQ. The phoP and phoQ gene products are highly 
similar to other members of bacterial two-component 
transcriptional regulators that respond to environmental 
10 stimuli and control the expression of a large number of 
other genes. A mutation at one of these phoP regulatory- 
region regulated genes, page, confers a virulence defect. 
Strains with page, phoP, or phoQ mutations afford partial 
protection to subsequent challenge by wild-type S. 

15 typhimurium. 

Salmonella species cause a spectrum of clinical 
disease that includes enteric fevers and acute 
gastroenteritis, Hook et al., 1980, supra. Infections 
with Salmonella species are more common in 

20 immunosuppressed persons, Celum et al. , 1987, J. Infect. 
Dis. 155:998. S. typhi, the bacterium that causes 
typhoid fever, can only infect man, Hook et al., 1980, 
supra. The narrow host specif icity of S. typhi has 
resulted in the extensive use of S. enteriditis 

25 typhimurium infection of mice as a laboratory model of 
typhoid fever, Carter et al., 1984 J. Exp. Med. 139:1189. 
s. typhimurium infects a wider range of hosts, causing 
acute gastroenteritis in man and a disease similar to 
typhoid fever in the mouse and cow. 

30 salmonella infections are acquired by oral 

ingestion. The organisms, after traversing the stomach, 
replicate in the small bowel, Hornik et al., 1970, N. 
Eng. J. Med. 283:686. Salmonella are capable of invasion 
of the intestinal mucosal cells, and S. typhi can pass 

35 through this mucosal barrier and spread via the Peyer's 
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patches to the lamina propria and regional lymph nodes. 
Colonization of the reticuloendothelial cells of the host 
then occurs after bacteremia. The ability of S. typhi to 
survive and replicate within the cells of the human 
5 reticuloendothelial system is essential to its 

pathogenesis, Hook et al., 1980, supra, Hornick et al., 
1970, supra, and Carter et al., 1984, supra. 

Immunity to Salmonella typhi involves humoral and 
cell-mediated immunity, Murphy et al. , 1987, J. Infect. 

10 _. 03.3.^,56:1005, and is obtainable by vaccination, Edelman 
et al., 1986, Rev. Inf". Dis. 8:324. Recently, human - - 
field trials demonstrated significant protective efficacy 
against S. typhi infection after intramuscular 
vaccination with partially purified Vi antigen, Lanata et 

15 al., 1983, Lancet 2:441. Antibody-dependent enhancement 
of Si typhi killing by T cells has been demonstrated in 
individuals who received a live S. typhi vaccine, 
indicating that these antibodies may be necessary for the 
host to generate a cell-mediated immune response, Levine 

20. et al., 1987, J. Clin. Invest. 79:888. The cell-mediated 
immune response is important in typhoid immunity since 
killed vaccines that do not induce this immune response 
are not protective in man, Collins et al. , 1972, Infect. 
Immun. 41:742. 

25 Summary of the Invention 

In general, the invention features a vaccine, 
preferably a live vaccine, including a bacterial cell, 
preferably a Salmonella cell, e.g., a S. typhi, S. 
enteritidis typhimurium, or S. cholerae-suis cell, the 

30 virulence of which is attenuated by the constitutive 

expression of a gene under the control of a two-component 
regulatory system. In preferred embodiments the 
constitutive expression is the result of a mutation at a 
component of the two-component regulatory system. In 
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preferred embodiments the bacterial cell includes a 
second mutation which attenuates virulence. 

In yet other preferred embodiments of the vaccine 
the two-component regulatory system is the phoP 
5 regulatory region, and the gene under the control of the 
two-component system is a phoP regulatory region 
regulated gene, e.g., a prgr or pag gene, e.g. , pagC. In 
preferred embodiments constitutive expression is the 
result of a change or mutation (preferably a non- 
10- revert ible mutation) at the promoter of the regulated 

gene or of the phoP regulatory Tregibn, e.g. , a mutation - 
in the phoQ or the phoP gene, e.g., the phoP c mutation. 

In preferred embodiments of the vaccine the 
Salmonella cell includes a first mutation which 
15 attenuates virulence, e.g., a mutation in a phoP 

regulatory region gene, e.g., a mutation in the phoP or 
phoQ gene, e.g., phoP c , or a mutation in a phoP 
regulatory region regulated gene, and a second mutation 
which attenuates virulence, e.g. , a mutation in an 
20 aromatic amino acid synthetic gene, e.g., an aro gene, a 
mutation in a phoP regulatory region regulated gene, 
e.g. , a mutation in a prg or pag locus, e.g. , a page 
mutation. 

In yet other preferred embodiments the bacterial 
25 cell includes a first mutation in a phoP regulatory 

region gene and a second mutation in an aromatic amino 
acid synthetic gene, e.g, an aro gene. 

In another aspect, the invention features a 
vaccine, preferably a live vaccine, including a bacterial 
30 cell , the virulence of which is attenuated by a mutation 
in a gene under the control of a two-component regulatory 
system. In preferred embodiments the bacterial cell 
includes a virulence attenuating mutation in a second 
gene, e.g., in an aromatic amino acid synthetic gene, 
35 e.g., an aro gene. 
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In yet other preferred embodiments of the vaccine 
the bacterial cell is Salmonella cell, the two-component 
regulatory system is the phoP regulatory region, and the 
gene under its control is a prg or a pag gene, e.g., the 
5 page gene. 

In another aspect the invention features a 
vaccine, preferably a live vaccine, including a 
Salmonella cell e.g., a S. typhi, S. enteritidis 
typhimurium, or S. cholerae-suis cell, including a first 

10 virulence attenuating mutation in an aromatic amino acid 

" biosynthetic "gene ,~ e . gry an aro gene , -and a second — ^ 

virulence attenuating mutation in a phoP regulatory 
region gene, e.g., a phoP" mutation. 

In another aspect the invention features a 

15 bacterial cell, or a substantially purified preparation 
thereof, preferably a Salmonella cell, e.g., a S. typhi, 
S. enteritidis typhimurium, or S. cholerae-suis cell, 
which constitutively expresses a gene under the control 
tJ of a two-component regulatory system and which includes a 

20 virulence attenuating mutation which does not result in 
constitutive expression of a gene under the control of 
the two-component regulatory system. In preferred 
embodiments the bacterial cell includes a mutation in a 
component of the two-component regulatory system. 

25 In preferred embodiments the bacterial cell is a 

Salmonella cell which expresses a phoP regulatory region 
regulated gene constitutively (the constitutive 
expression preferably caused by a mutation, preferably a 
non-revertible mutation, e.g., a deletion in the phoP 

30 regulatory region, e.g., a mutation in the phoQ or phoB 
gene, e.g., phoP c ) , and which includes a, virulence 
attenuating mutation, preferably a non-revertible 
mutation, e.g., a deletion, preferably in an aromatic 
amino acid synthetic gene, e.g., an aro gene, or in a 

35 phoP regulatory region regulated gene, e.g., a prg or pag 
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gene, e.g., page which does not result in the 
constitutive expression of a gene under the control of 
the phoP regulatory region. 

In another aspect, the invention features a 
5 bacterial cell, or a substantially purified preparation 
thereof, e.g. , a Salmonella cell, e.g., a S. typM cell, 
an 5. enteritidis typhimurium or a S. cholerae-suis cell, 
including a virulence attenuating mutation in a gene 
regulated by a two-component regulatory system. In 

10 „pref erred embodiments the virulence attenuating mutation 
is in a phoP regulatory region re 
prg or pag gene, e.g., page. 

In preferred embodiments the bacterial cell 
includes a second mutation, e.g., in an aromatic amino 

15 acid synthetic gene, e.g., an aro gene, in a phoP 

regulatory region gene, e.g., the phoP or phoQ genes, or 
in a phoP regulating region regulated gene, e.g., a prg 
or a pag gene, e.g., page, which attenuates virulence but 
which does not result in constitutive expression of a 

20 phoP regulatory region regulated gene. 

The invention also features a live Salmonella 
cell, or a substantially purified preparation thereof, 
e.g., aS. typhi , S. enter iditis typhimurium; or 
5. cholerae-suis cell, in which there is inserted into a 

25 virulence gene, e.g., a gene in the phoP regulating 

region, or a phoP regulating region regulated gene, e.g., 
a prg or a pag locus, e.g., page, a gene encoding a 
heterologous protein, or a regulatory element thereof. 

In preferred embodiments the live Salmonella cell 

30 carries a second mutation, e.g., an aro mutation, e.g., 
an aroA mutation, e.g., aroA" or aroADEL407, that 
attenuates virulence. 

In preferred embodiments the DNA encoding a 
heterologous protein is under the control of an 

35 environmentally regulated promoter. In other preferred 
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embodiments the live Salmonella cell further includes a 
DNA sequence encoding T7 polymerase under the control of 
an environmentally regulated promoter and a T7 
transcriptionally sensitive promoter, the T7 
5 transcriptionally sensitive promoter controlling the 
. expression of the heterologous antigen. 

The invention also features a vector capable of 
integrating into the chromosome of Salmonella including: 
a first DNA sequence encoding a heterologous protein; a 

10 second (optional) DNA sequence encoding a marker e.g., a 

selective marker/ e.g. 7 a gene- that-conf ers-resistance .... 

for a heavy metal resistance or a gene that compliments 
an aurotrophic mutation carried by the strain to be 
transformed; and a third DNA sequence, e.g., a phoP 

15 regulon encoded gene, e.g., a prg or a pag locus, e.g., 
page, encoding a product necessary for virulence, the 
third DNA sequence being mutationally inactivated. 

In other preferred embodiments: the first DNA 
sequence is disposed on the vector so as to mutationally 

20 inactivate the third DNA sequence; the vector cannot 
replicate in a wild- type Salmonella strain; the 
heterologous protein is under the control of an 
environmentally regulated promoter; and the vector 
further includes a DNA sequence encoding T7 polymerase 

25 under the control of an environmentally regulated 

promoter and a T7 transcriptionally sensitive promoter, 
the T7 transcriptionally sensitive promoter controlling 
the expression of the heterologous antigen. 

In another aspect the invention includes a method 

30 of vaccinating an animal, e.g., a mammal, e.g., a human, 
against a disease caused by a bacterium, e.g., 
Salmonella, including administering a vaccine of the 
invention. 

The invention also includes a vector including DNA 
35 which encodes the page gene product; a cell transformed 
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with the vector; a method of producing the page gene 
product including culturing the transformed cell and 
purifying the page gene product from the cell or culture 
medium; and a purified preparation of the page gene 
5 product. 

In another aspect the invention includes a method 
of detecting the presence of Salmonella in a sample 
including contacting the sample with page encoding DNA 
and detecting the hybridization of the page encoding DNA 

10.--— to nucleic_acid _in the sample. 

In another aspect the invention features a method'" 
of attenuating the virulence of a bacterium, the 
bacterium including a two-component regulatory system, 
including causing a gene under the control of the two- 

15 component system to be expressed const itutively. In 

preferred embodiments the bacterium is Salmonella, e.g., 
S. typhi, S. enteritidis typhimurium, or S. cholerae^ 
suis, and the two-component system is the phoP regulatory 
region. 

20 Two-component regulatory system, as used herein, 

'. refers to a bacterial regulatory system that controls the 
expression of multiple proteins in response to 
environmental signals. The two-components referred to in 
the term are a sensor, which may, e.g., sense an 

25 environmental parameter and in response thereto promote 
the activation, e.g. by promoting the phosphorylation, of 
the second component, the activator. The activator 
affects the expression of genes under the control of the 
two-component system. A two-component system can 

30 include, e.g., a histidine protein kinase and a 

phosphorylated response regulator, as is seen in both 
gram positive and gram negative bacteria. In E. coli, 
e.g., 10 kinases and 11 response regulators have been 
identified. They control chemotaxis, nitrogen 

35 regulation, phosphate regulation, osmoregulation, 
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sporulation, and many ither cellular functions, Stock et 
al., 1989 Microbiol. Rev. 52:450-490, hereby incorporated 
by reference. A two-component system also controls the 
virulence of Agrobacterium tumefasciens plant tumor . 
5 formation, Leroux et al. EMBO J 6:849-856, hereby 

incorporated by reference). Similar virulence regulators 
are involved in the virulence of Bordetella pertussis 
Arico et al., 1989, Proc. Natl. Acad. Sci. USA 86:6671- 
6675, hereby incorporated by reference, and Shigella 

10 flexneri, Bernardini et al., 1990, J. Bact. 172 :6274- 

_ -6281, -hereby- incorporated -by- reference. - ^ - •• ' - 

Environmentally regulated, as used herein refers 
to a pattern of expression wherein the expression of a 
gene in a cell depends on the levels of some 

15 characteristic or component of the environment in which 
the cell resides. Examples include promoters in 
biosynthetic pathways which are turned on or of f by the 
level of a specific component or components, e.g., iron, 
temperature responsive promoters, or promoters which are 

20 expressed more actively in specific cellular 

compartments, e.g., in macrophages or vacuoles. 

A vaccine, as used herein, is a preparation 
including materials that evoke a desired . biological 
response, e.g., an immune response, in combination with a 

25 suitable carrier. The vaccine may include live organism, 
in which case it is usually administered orally, or 
killed organisms or components thereof, in which case it 
is usually administered perinterally . The cells used for 
the vaccine of the invention are preferably alive and 

30 thus capable of colonizing the intestines of the 
inoculated animal. 

A mutation, as used herein, is any change (in 
comparison with the appropriate parental strain) in the 
DNA sequence of an organism. These changes can arise 

35 e.g., spontaneously, by chemical, energy e.g., X-ray, or 
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other forms of mutagenesis, by genetic engineering, or as 
a result of mating or other forms of exchange of genetic 
information. Mutations include e.g., base changes, 
deletions, insertions, inversions, translocations or 
5 duplications. 

A mutation attenuates virulence if, as a result of 
the mutation, the level of virulence of the mutant cell 
is decreased in comparison with the level in a cell of 
the parental strain, as measured by (a) a significant 

10 (e .g.. ,.. .at_J.east^ 50%) decrease in virulence in the mutant 
strain compared to the parental strain, or (b) a 
significant (e.g., at least 50%) decrease in the amount 
of the polypeptide identified as the virulence factor in 
the mutant strain compared to the parental strain. 

15 a non-revertible mutation, as used herein, is a 

mutation which cannot revert by a single base pair 
change, e.g., deletion or insertion mutations and 
mutations that include more than one lesion, e.g.,. a 
mutation composed of two separate point mutations. 

20 The phoP regulatory region, as used herein, is a 

two-component regulatory system that controls the 
expression of pag and prg genes. It includes the phoP 
locus and the phoQ locus. 

phoP regulatory region regulated genes, as used 

25 herein, refer to genes such as pag and prg genes. 

pag, as used herein, refers to a gene which is 
positively regulated by the phoP regulon. 

prg, as used herein, refers to a gene which is 
negatively regulated by the phoP regulon. 

30 An aromatic amino acid synthetic gene, as used 

herein, is a gene which encodes an enzyme which catalyzes 
a step in the synthesis of an aromatic amino acid. aroA, 
aroC, and aroD are examples of such genes in Salmonella. 
Mutations in these genes can attenuate virulence without 

35 the total loss of immunogenicity. 
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Abnormal expressions , as used herein, means 
expression which is higher or lower than that seen in 
wild type. 

Heterologous protein, as used herein, is a protein 
5 that in wild type, is not expressed or is expressed from 
a different chromosomal site, e.g., a heterologous 
protein is one encoded by a gene that has been inserted 
into a second gene. 

Virulence gene, as used herein, is a gene the 

10 inactivation of which results in a Salmonella cell with 

_ _ less virulence . than that of a similar. Salmonella cell in 

which the gene is not inactivated. Examples include the 
phoP and page genes . 

A marker, as used herein, is gene product the 

15 presence of which is easily determined, e.g., a gene 
product that confers resistance to a heavy metal or a 
gene product which allows or inhibits growth under a 
given set of conditions. 

Purified preparation, as used herein, is a 

20 preparation, e.g. , of a protein, which is purified from 
the proteins, lipids, and other material with which it is 
associated. The preparation is preferably at least 2-10 
fold purified. 

Constitutive expression, as used herein, refers to 

25 gene expression which is modulated or regulated to a 

lesser extent than the expression of the same gene in an 
appropriate control strain, e.g. , a parental or in wild- 
type strain. For example, if a gene is normally 
repressed under a first set of conditions and derepressed 

30 under a second set of conditions constitutive expression 
would be expression at the same level, e.g., the 
repressed level, the derepressed level, or an 
intermediate level, regardless of conditions. Partial 
constitutive expression is included within the definition 

35 of constitutive expression and occurs when the difference 
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between two levels of expression is reduced in comparison 
in what is seen in an appropriate control strain, e.g., a 
wild-type or parental strain. 

A substantially purified preparation of a 
5 bacterial cell is a preparation of cetlls wherein 

contaminating cells without the desired mutant genotype 
constitute less than 10%, preferably less than 1%, and 
more preferably less than 0.1% of the total number of 
cells in the preparation. 

10 The invention allows for the attenuation of 

- "virulence -of -bacteria and of. vaccina, that include 

bacteria, especially vaccines that include live bacteria, 
by mutations in two-component regulatory systems and/or 
in genes regulated by these systems. The vaccines of the 

15 invention are highly attenuated for virulence but retain 
immunogenicity, thus they are both safe and effective. 

The vectors of the invention allow the rapid 
construction of strains containing DNA encoding 
heterologous proteins, e.g. , antigens. The heterologous 

20 protein encoding DNA is chromosomally integrated, and 
thus stable, unlike plasmid systems which are dependent 
on antibiotic resistance or other selection pressure for 
stability. Live Salmonella cells of the invention in 
which the expression of heterologous protein is under the 

25 control of an environmentally responsive promoter do not 
express the heterologous protein at times when such 
expression would be undesirable e.g., during culture, 
vaccine preparation, or storage, contributing to the 
viability of the cells, but when administered to humans 

30 of animals, express large amounts of the protein. This 
is desirable because high expression of many heterologous 
proteins in Salmonella can be associated with toxicity to 
the bacterium. The use of only a single integrated copy 
of the DNA encoding the heterologous protein also 

35 contributes to minimal expression of the heterologous 
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protein at times when xpression is not desired. In 
embodiments where a virulence gene, e.g., the page gene, 
contains the site of integration for the DNA encoding the 
heterologous protein the virulence of the organism is 
5 attenuated. 

Other features and advantages of the invention 
will be apparent from the following description of the 
preferred embodiments and from the claims. 

Description of the Preferred Embodiments 
10 The drawings will first be described. 

Drawings ^ 

Fig. 1 is a graph of the survival of Salmonella 
strains within macrophages. 

Fig. 2 is a map of the restriction endonuclease 
15 sites of the page locus. 

Fig. 3 ±s i a map of the DNA sequence of the pag C 
region (Sequence ID No. 1). 
Strain Deposit 

PhoP c strain CS022 (described below) has been 
20 , deposited with the American Type Culture Collection 

(Rockville, MD) and has received ATCC designation 
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Constitutive Expression of the Pho P Reaulon Attenuates 
salmonella Virulence and Sur vival within Macrophages 

The phoP constitutive allele (PhoP G ) , pho-24, 
results in derepression of pag loci- Using diethyl 
5 sulfate mutagenesis of S. typhimurium LT-2> Ames and co- 
workers isolated strain TA2367 pho-24 (all strains, 
materials, and methods referred to in this section are 
described below) , which contained a phoP locus mutation 
that resulted in constitutive production of acid 

10 phosphatase in rich media, Kier et al. , 1979, J. 

. - -Bacteriol.-, ,138:155, hereby ir^9^or a ;ted *y 5£ference. 

This phoP-regulated acid phosphatase is encoded by the 
phoN gene, a pag locus, Kier et al. , 1979, supra, Miller 
et al., 1989, supra. To analyze whether the pho-24 

15 allele increased the expression of other pagr loci the 
effect of the pho-24 allele on the expression of other 
pag loci recently identified as transcriptional (e.g., 
pagA and pagrB) and translational (e.g. , page) fusion 
proteins that required phoP and phoQ for expression, 

20 Miller et al., 1989, supra, was determined, pag gene 
fusion strains, isogenic except for the pho-24 allele, 
were constructed and assayed for fusion protein activity. 
PhoP c derivatives of the pagA: :Mu dJ_ and pagB: :Mu dJ 
strains produced 480 and 980 U, respectively, of 

25. galactosidase in rich medium, an increase of 9- to 10- 
fold over values for the fusion strains with a wild-type 
phoP locus, see Table 1. 



WO 92/11361 



PCT/US91/09604 



- 15 - 



TABLE 1. Bacterial strains and properties 



Enzyme 

strain Genotype activity Reference or 

(U) * . source 



10428 



TA2367 



CS003 



CS022 
CS023 

CS012 



CS013 



CS119 



SC024 
SC025 
SC026 

CS015 



TT13208 



Wild type 180 (A) 



pho-24 1,925 (A) 



bphoP ApurB <10 (A) 



pho-24 1,750 (A) 

pho-24 phoN2 25 (A) 
zxx: :6251Tnl0d-Cam 

pagrAl: :MU dJ 45 (B) 



pagrBl::MU dJ 120 (B) 



pagrCl: :TnphoA phoN2 85 . (C) 



zxx: :6251Tnl0d-Cam 

pagAl::Mu dJ pho-24 450 (B) 

pagBl::Mu dJ pho-24 980 (B) 

pagCl::TnphoAphd-24phoN2 385 (B) 

zxx: :6251Tnl0d-Cam 

phoP102 : :Tnl0d-Cam <10 (A) 



phoP105i :TnI0d 



<10 (A) 



ATCC; 
Miller et 
al . 1989 ,- 
supra 
Kier et 
al., 1974, 
supra 
Miller et 
al., 1989, 
supra 
This work 
This work 

Miller et 
al., 1989, 
supra 
Miller et 
al., 1989, 
supra 
Miller et 
al., 1989> 
supra 

This work 
This work 
This work 

Miller et 
al., 1989, 
supra 



a A. Acid phosphatase; B, 0-galactosidase; C, 
alkaline phosphatase. 



Gift of Ning Zhu and John Roth. 
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The pagC. :TnphoA gene fusion produced 350 U of 
alkaline phosphatase, an increase of three- to fourfold 
over that produced in - strain CS119, which is isogenic 
except for the pho-24 mutation, Miller et al., 1989, 
5 supra. These results compare with a ninefold increase in 
the acid phosphatase activity in strain CS022 on 
introduction of the pho-24 allele. Therefore, these 
available assays for pag gene expression document that 
the pho-24 mutation causes constitutive expression of pag 

10 loci other than phoN. : . •_ 

Identifications of prote in species that are ••••••• 

repressed as wel] as' activated in t he PhoP c mutant strain 
Whole-cell proteins of strain CS022 were analyzed to 
estimate the number of protein species that could be 

15 potentially regulated by the PhoP regulon. Remarkably, 
analysis by one-dimensional polyacrylamide gel 
electrophoresis of the proteins produced by strains with 
the PhoP c phenotype indicated that some protein species 
were decreased in expression when many presumptive pag 

20 gene products were fully induced by the pho-24 mutation. 
The proteins decreased in the PhoP c strain might 
represent products of genes that are repressed by the 
PhoP regulator. Genes encoding proteins decreased by the 
pho-24 allele are designated prg loci, for phoP-repressed 

25 genes. Comparison of wild-type, PhoP", and PhoP c mutant 
strain proteins shows that growth in LB medium at 37°C 
represents repressing conditions for pagr gene products 
and derepressing conditions for prg gene products. 

To estimate the total number of potentially PhoP- 

30 regulated gene products, the total cell proteins of wild- 
type and PhoP c mutant strains grown in LB were analyzed 
by two-dimensional gel electrophoresis. At least 40 
species underwent major fluctuation in expression in 
response to the pho-24 mutation. 
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Virulence defects of the PhoP c strain Remarkably , 
strains with the singl pho~24 mutation were markedly 
attenuated for virulence in mice (Table 2). The number 
of PhoP c organisms (2 x 10 s ) that killed 50% of BALB/c 
5 mice challenged (LD 50 ) by the intraperitoneal (i.p.) route 
was near that (6 x 10 5 ) of PhoP" bacteria, Miller et al., 
1989, supra. The PhoP c strains had growth comparable to 
wild-type organisms in rich and minimal media. The PhoP c 
mutants were also tested for alterations in 
10 lipopolysaccharide, which could explain the virulence 
■ -- -.—defect- observed . ^Strain. CS 02 2 had_ normal sensiti vit y tp_ ... _ 
phage P22, normal group B reactivity to antibody to O 
antigen, and a lipopolysaccharide profile identical to 
that of the parent strain, as determined by 
15 polyacrylamide gel electrophoresis and staining. 
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Since the TA2367 pho-24 strain was constructed by 
chemical mutagenesis and could hav another linked 
mutation responsible for its virulence defect revertants 
of the PhoP c were isolated to determine whether the pfto- 
5 24 allele was responsible for the attenuation of 
virulence observed. Phenotype PhoP c revertants, 
identified by the normal levels of acid phosphatase in 
rich medium, were isolated among the bacteria recovered 
from the livers of mice infected with strain CS022. Six 

10 separate phenotypic revertants, designated CS 122 to 

„ CS 1 28, ere . found to be fully virulent (LD 50 of less than „ 

20 organisms for BALB/c mice). The locus responsible for 
the reversion phenotype was mapped in all six revertants 
tested for virulence by bacteriophage P22 cotransduction 

15 and had linkage characteristics consistent with the phoP 
locus (greater than 90% linkage to purB) . These data 
indicate that these reversion mutations are not 
extragenic suppressors but are intragenic suppressors or 
true revertants of the pho-24 mutation. Thus, the, 

20 virulence defect of PhoP c mutants is probably the result 
of a single revertible mutation in the phoP locus and not 
the result of a second unrelated mutation acquired during 
mutagenesis. 

Reversion frequency of the PhoP c phenotyp e The 

25 reversion frequency of the PhoP c mutation in vivo in mice 
was investigated to assess whether reversion could reduce 
the LD 50 of this strain. The presence of the revertants 
of strain CS022 was tested for by administering 10 6 , 10 4 , 
and 10 2 challenge organisms to each of eight animals by 

30 i.p. injection. On day 7, three animals died that 

received 10 6 PhoP c organisms, on that day, the livers and 
spleens of all animals were harvested and homogenized in 
saline. After appropriate dilution, 10% of the tissue 
was plated on LB plates containing the chromogenic 

35 phosphatase substrate XP. Revertants were identified by 
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their lighter blue - colonies compared with PhoP e bacteria 
and were confirmed by quantitative acid phosphatase 
assays. An estimated 10 7 , 10 s , and 10 3 organisms per 
organ were recovered from animals at each of the three 
5 respective challenge doses. Revertants were identified 
only at the highest dose and comprised .0.5 to 1%, or 10 5 
organisms per organ, at the time of death. It is likely 
that revertants are able to compete more effectively for 
growth in these macrophage-containing organs, since 

10 strain CS022 is deficient in survival within macrophages 
(see belowT. However/ revertants were Tio^^ 
fewer than 10 5 organisms were administered in the 
challenge dose, suggesting that the reversion frequency 
must be approximately 10" 5 . The reversion rate of the 

15 PhoP c phenotype for CS022 bacteria grown in LB is in fact 
6x1 0" 4 when scored by the same colony phenotypes. The 
percentage of revertants recovered from animals near 
death suggests that pressure is applied in vivo that 
selects for revertants of the PhoP c phenotype and implies 

20 that the virulence defect observed could be much greater 
quantitatively for a strain with a nonrevertible PhoP c 
mutation. 

The PhoP c strain is deficient in surviva l within 
macrophages Because of the importance of survival within 

25 macrophages to Salmonella, virulence Fields et al., 1986, 
Proc. Natl. Acad. Sci. USA 83:5189, hereby incorporated 
by reference, PhoP c bacteria were tested for this 
property. Strain CS022 was defective in the ability to 
grow and persist in macrophages as compared with wild- 

30 type organisms (Fig. 1). In Fig. 1 the survival of 

strain CS022 (PhoP c ) (triangles) in cultured macrophages 
is compared with that of wild-type S. typhimurium ATCC 
10428 (cicles) . The experiment shown is a representative 
one. The difference between the two strains at 4 and 24 
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hours is significant (P < 0.05) . PhoP" bacteria seemed 
to have a macrophage survival defect qualitatively 
similar to that of PhoP c bacteria but survived 
consistently better by two- to threefold in side-by-side 
5 experiments. The increased recovery of organisms that 
reverted to PhoP c phenotype in mouse organs rich in 
macrophage content is consistent with the reduced 
macrophage survival of PhoP c mutants in vitro. 

Use of the PhoP c strain as a live vaccine It has 

10 been previously reported that PhoP" strains are useful as 
- live vaccines in protectingagainst mouse typhoid, Miller 
et al. , 1989, supra. The immunogenicity of PhoP c when 
used as live attenuated vaccines in mice was compared 
with the of PhoP". This was done by simultaneous 

15 determination of survival f after graded challenge doses 
with the wild-type strain ATCC 10428, in mice previously 
immunized with graded doses of the two live vaccine 
strains. CS015 phoP : :Tnl0d-Cam and CS022 pho-24, as well 
as a saline control. The results obtained (Table 2) 

20 suggest the following conclusions: (i) small i.p. doses 
of the PhoP c strain (e.g., 15 organisms) effectively 
protect mice from challenge doses as large as 5xl0 5 
bacteria (a challenge dose that represents greater than 
10 4 i.p. LD 50 s) , (ii) large doses of PhoP c organisms given 

25 orally completely protect mice from an oral challenge 
consisting of 5xl0 7 wild-type bacteria (over 200 oral 
wild-type LD 5Q s) and (iii) by comparison, a large dose of 
PhoP" organisms (5xl0 5 ) does not provide similar 
protection. The reversion of the PhoP c mutation in vivo 

30 somewhat complicates the analysis of the use of these 

strains as vaccines, since revertants of the CS022 strain 
(i.e., wild-type cells) could increase immunogenicity). 
However, we were unable to identify revertants by 
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examining 10% of the available spleen and liver tissue 
from those mice that received 10* or fewer organisms, 

strains. Materials and Methods The strains, 
materials, and methods used in the PhoP regulon work 
5 described above are as follows. 

American Type Culture Collection (ATCC) strain 
14028, a smooth virulent strain of S. typhimurium, was 
the parent strain for all virulence studies. Strain 
TT13208 was a gift from Nang Zhu and John Roth. Strain 

10 TA2367 was a generous gift of Gigi Stortz and Bruce Ames, 

Kief et al . , 1979 , supra . - Bacteriophage- P22HT int. was 

used in transductional crosses to construct strains 
isogenic except for phoP locus mutations, Davis et al., 
1980, Advanced Bacterial Genetics, p. 78, 87. Cold 

15 Spring Harbor Laboratory, Cold Spring Harbor, NY, hereby 
incorporated by reference. Luria broth was used as rich 
medium, and minimal medium was M9, Davis et al., 1980, 
supra. The chromogenic phosphatase substrate 5-bromo-4- 
chloro-3indolyl phosphate (XP) was used to qualitatively 

20 access acid and alkaline phosphatase production in solid 
media. 

Derivatives of S. typhimurium ATCC 10428 with the 
pho-24 mutation were constructed by use of strain TA2367 
as a donor of the purB gene in a P22 transductional cross 

25 with strain CS003 AphoP ApurB, Miller et al., 1989, 
supra.. Colonies were then selected for the ability to 
grow on minimal medium. A transductant designated CS022 
(phenotype PhoP c ) that synthesized 1,750 U of acid 
phosphatase in rich medium (a ninefold increase over the 

30 wild -type level in rich medium) was used in further 
studies. 

Derivatives of strains CS022 and CS023 pho-24 
phoN2 zxxt :6251Tnl0d-Cam, and acid phosphatase-negative 
derivative of CS022, containing pag gene fusions were 
35 constructed by bacteriophage P22 transductional crosses, 
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using selection of TnphoA- or Mu dJ-encoded kanamycin 
resistance. Strains were checked for the intact pag gene 
fusion by demonstration of appropriate loss of fusion 
protein activity on introduction of a phoPlOSi :TnlOd or 
5 phoP102: :Tn20d- Cam allele. 

Assays of acid phosphatase, alkaline phosphatase, 
and /?-galactosidase were performed as previously 
described, Miller et al. , 1989, supra and are reported in 
units as defined in Miller, 1972, Experiments in 

10 molecular genetics, p. 352-355, Cold Spring Harbor 

Laboratory ,. Gold- Spring -Harbor , - NY , hereby incorporated^- 

by reference. 

In the mouse virulence and vaccination studies 
bacteria grown overnight in Luria broth were washed and 

15 diluted in normal saline. The wild-type parent strain of 
CS022 (ATCC 10428) was used for all live vaccine 
challenge studies. This strain has a 50% lethal dose 
(LD 50 ) for naive adult BALB/c mice of less than 20 
organisms when administered by intraperitoneal (i.p.) , : 

20 injection and 5xl0 4 when administered orally in NaHC0 3 . 
Mice were purchased from Charles River Breeding 
Laboratories, Inc. (Wilmington, Mass.) and were 5 to 6 
weeks of age at initial challenge. All i.p. inoculations 
were performed as previously described, Miller et al., 

25 1989, supra. Oral challenge experiments were performed 
with bacteria grown in LB broth and concentrated by 
centrifugation. The bacteria were resuspended in 0.1 M 
NaHC0 3 to neutralize stomach acid, and administered as a 
0.5-ml bolus to animals under ether anesthesia. Colony 

30 counts were performed to accurately access the number of 
organisms administered. All challenge experiments were 
performed 1 month after i.p. inoculation and 6 weeks 
after oral challenge. Challenge inocula were 
administered by the same route as vaccinations. The care 

35 of all animals was under institutional guidelines as set 
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by the animal are committees at the Massachusetts General 
Hospital and Harvard Medical School. 

Protein electrophoresis was performed as follows. 
One-dimensional protein gel electrophoresis was performed 
by the method of Laemmli, 1970, Nature 222:680, hereby 
incorporated by reference, on whole-cell protein extracts 
of stationary-phase cells grown overnight in Luria broth. 
The gels were fixed and stained with Coomassie brilliant 
blue R250 in 10% acetic acid-10% methanol. Two- 
dimensional protein gel electrophoresis was performed by 
method- of O'Farrell,. 1975,_ J, Biol... Chem. 250:4007, 
hereby incorporated by reference, on the same whole-cell 
extracts. Isoelectric focusing using 1-5% P H 3.5 to 10 
ampholines (LKB Instruments, Baltimore, Md.) was carried 
out for 9,600 V h (700 V for 13 h 45 min) . The final 
tube gel pH gradient extended from P H 4.1 to pH 8.1 as 
measured by a surface pH electrode (BioRad Laboratories, 
Richmond, Calif.) and colored acetylated cytochrome pi 
markers (Calbiochem-Behring, La Jolla, Calif.) run in an 
adjacent tube. The slab gels were silver stained, Merril 
et al., 1984, Methods Enzymol. 104:441, hereby 
incorporated by reference. 

In the macrophage survival assays experiments were 
performed as previously described, Miller et al., 1989, 
supra, by the method of Buchmeier et al., 1989, Infect. 
Immun. 57:1, hereby incorporated by reference, as 
modified from the method of Lissner et al, 1983, J. 
Immunol. 131:3006, hereby incorporated by reference. 
Stationary-phase cells were opsonized for 30 min in 
30 normal mouse serum before exposure to the cultured bone 
marrow-derived macrophages harvested from BALB/c mice. 
One hour after infection, gentamicin sulfate (8 ng/nl) 
was added to kill extracellular bacteria. All time 
points were done in triplicate and repeated on three 
35 separate occasions. 
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PhoP c Mutant Strains Are More Effective as Live Vaccines 

PhoP c mutant S. typhimurium are very effective 
when used as a live vaccine against mouse typhoid fever 
and are superior to PhoP~ bacteria • As few a 15 PhoP c 
5 bacteria protect mice against 10 5 LD 50 (lethal doses 50%) 
of wild type organisms by the intraperitoneal route 
(Table 3) . This suggests that pag gene products are 
important antigens for protective immunity against mouse 
typhoid. Preliminary results have documented that 

10 antigens recognized by serum of chronic typhoid carriers 

~ fecognizes "some""phoP*-regulated~gene-products-of 5- typhi 

If protective antigens are only expressed within the 
host, then dead vaccines only grown in rich media may not 
induce an immune response against these proteins. 

15 The use of different S. typhimurium dead vaccine 

preparations containing different mutations in the phoP 
regulon was evaluated. As can be seen in Table 3 no dead 
cell preparations (even those containing mixtures of 
PhoP* and PhoP c bacteria) are as effective vaccines as are 

20 live bacteria. This suggests, that there are other 

properties of live vaccines that increase immunogenicity 
or that important non-PhoP-regulated antigens are not in 
these preparations. The only protection observed in any 
animals studied was at the lowest challenge dose for 

25 those immunized with PhoP c bacteria. This further 
suggests that phoP activated genes are important 
protective antigens. 



WO 92/11361 



PCT/US91/09604 



- 16 - 



m 
G) 

<d 



0) 
C 

o 

0 

<d 
> 

td 

0) 
T3 



(d 

0) 

tn 

w 
c 
o 

•H 

-P 
rd 
-P 

i 

c 
o 

r-i 

0) 

n 

ft 
0 

x: 
a, 

.C 

<P 

-rl 

H 
H 
O 

c 
o 
e 

H 
CO 



in 
E 
to 

c 

CP 

Otn 
o 

*d 
iH 
•H 



(Dm 
10 O 
O H 

X 

0) 

CJ>V£> 

C 

0) 



u 



c 
o 

•H O 

flfl >« 

•H O 

O C 
O 0) 

> a 



(d 



in 



n 



c\ n ^ 

* — r-i H 



cn 



CO o ^ 



ft 

o 
ft 



o 
& 
>i 

I u 

TJ ft ft 
iH O O 

£ ft ft 

00 
CM 



I 

ft 
O 

.ft 



m 

H 
O 

, CO 

o 



o 

din m m 

0) U rl W W 

c V o o o 

O H CO CO CO 

s < a a u 



E 

O 
I 

O 

c 



O 

o 

ft 



4J C 



a 

a u 
M O 

01 © 
x: Eh 



<6 © 
M «J 

Q) O 
4J *H 

O *o 
<d c 

© © 
rH O 

C 5 

.rl 

rH 

(0 © 

E A 

u -M 

o 

CO 

eo 

o a 

x CD 
in -ri 
c 

•rl H 

? 0 

* a 

4J ft 
u >i 

«j +> 

rH 

*! 

^* 

o © 

-rl CP 

? c 
+> © 

•d iH 

H £ 

~4 O 
C 

3 © 

g © 

© © 

VI u 

i 

o © * 

•c a c 

n -p 

0 S 

oj ■ c 

cm cn *h 

O iJ o 

to a o 

o oa * 
c 

0 

u 

03 

5 



c 

3 

CD 
>l 

•s 



Q 

CD 
© 
U 
ft 
X 

© 



c 

CD 
© 

c 
© 

© 

-p 

> 

43 
O 
«J 

I 

o 



© 

1 

c 

c 
td 

i 

c 

© 

c 
© 

rH • © 

ta v > 

A © -H 

U CD 4J 

C 3 

H © «U 

© rH -H 

4J rH 

(M <f © 

o 

© 

u u 
o © 



c. 
o 

-rl 

CD 
CO 
© 
U 

a 
© 

© 

15 

rH 
3 

© 

c 

3 



CO 

© ^ 

CO 0 

© -r| 

x: 4J 
© 

u ~ 

a) « 



© 

© 

0 4£ 

O 

O -rl 

U T) 

0 c 
> 



D O 

ft 

0 



e 
© 

© 

© 

e 
ca 
© 

14 

© 

H 

Oi 
O 



c 

0 

-rl 

a 

CD 

© 
ft 

c 

CO* 

o 
© 

C 
© 

© 

> 

-rl 
4J 
O 

<d 

i 

% 

ft 



c 

0 

-rl 
CD 

- CD 
© 
U 

a 
x 
© 

o 

M 
O 
« 



CO 

© 

• «f 
a u 

© -rl 

c "O 
© c 

D> -rl 

© CI, 
CD O 
O « 

© a 

ft 
© 
u 



O 

•s 



WO 92/11361 



PCT/US91/09604 



aroK PhoP Reaulon Doubl e Mutant Strains 

Recent efforts by Stocker, Levine, and colleagues 
have focused on the use of strains with auxotrophic 
mutations in aromatic amino acid and purine pathways as 
5 live vaccines, Hoseith et al., 1981, Nature £91:238, 

hereby incorporated by reference, Stocker, 1988, Vaccine 
6:141, hereby incorporated by reference, and Levine et 
al., 1987, J. Clin, Invest. 79:888, hereby incorporated 
by reference. Purine mutations were found to be too 

10 attenuating for immunogenicity, likely because purines 

are not available to the organism within the mammalian 
host, Sigwart et al., 1989, Infect. Immun. 57:1858, 
hereby incorporated by reference. Because auxotrophic 
mutations may be complemented by homologous recombination 

15 events with wild type copies donated from environmental 
organisms or by acquiring the needed metabolite within 
the hosit, it would seem prudent for live vaccines to 
contain a second attenuating mutation in a different 
virulence mechanism, (i.e., not just a second mutation in 

20 the same metabolic pathway) . Additionally, in mice the 
aroA mutants have some residual virulence. Various 
strains with aroA mutations combined with phoP regulon 
mutations were investigated for virulence attenuation and 
immunogenicity. Table 4 demonstrates that a PhoP"" or 

25 PhoP c mutation further attenuates aroA mutant S. 

typhimurium by at least 100-fold and that, at least at 
high levels of vaccinating organisms, immunogenicity is 
retained. Strains with both a page" and phoP c phenotype 
are also further attenuated than either mutation alone. 

30 Therefore, phoP regulon mutations may increase the safety 
of aroA live vaccine preparations. 



WO 92/11361 



PCT/US91/09604 



-IE- 



< 

rH 



to 
c 
o 

•H 
-P 
<0 

! 

c 
o 

•H 

CP 
<D 
H 

04 

o 
04 
>1 

,Q 

W 
-P 

c 

«J 
-p 

s 

O 

c 
o 

•H 

-P 
(0 

g 

o 
-p 

+3 



CO 

e 
o 

-p 



+> 

a 

OS 
H 
M 

O 

§ 

to 
o 

-(fl 

i 

C ■ 
CP 
•H 

u 

CO 

>^ 

4-1 * 
O ^ 

to tn 
m e 

o W 

>.K 
•H C 
> CO 

U CP 
CO o 



o 

o 



0% 

o 



CO 

o 



O 



VD 
O 
rH 



0) 

p4 

>1 

■P 
O 
C 
Q) 



co 
-p 

CO 



vo vo vo vo vo vo vo 
vo vo vo vo vo vo vo 



VO V0 VO VD VO VD VO 
O O H 03 CM H O 

, vo VO vo vo vo vo vo 

O O VO VO VO VD O 
VO VO VO VO VO VD VO 

h*h*vo vo vo vo *r 



VO VO VO VD VO VD VO 
VO* VO* VO* VD VO VD VO 



1 « « 

Wu 04i Oju 

-H Pn O PLi O CU 

X O O ^ O 

.C 04 ,C 04 ,C 

H 0* P4 & 

I TJ 1 VD I VD I 

O 0 O O v> 
^ ^ M H k| J co , 
CO fO CO CO «0 CO 04 



*r vo cm m if) vo vo 

ONCJNHHCJ 

onn n n n o 
CO t-3 CO CO CO CO CO 

uwuuouo 



WO 92/11361 



PCT/US91/09604 



W 

c 
o 

•H 

-P 

■p 

1 

c 
o 

H 

o» 

0 

u 

0 

x: 
-p 

> 



Q> 
O 

CO 
<w 

o 

>1 

o 

id 

u 

•H 
U4 

<w 
<D 

CD 
> 
•H 

-P 
U 
Q) 
,-P 
O 

u 











(0 




•H 




c 
















0 




fl) 




a 




>i 




-p 








H 




•H 




> 




~_0- ^ 




w 




a) 




to 


H 


0 




13 




0 


in 






c 




0 




rH 




rH 




<0 




.C 




0 




*W 


in 


O 


0 


H 


w 


X 






0 


in 


> 






> 




M 








CO 





3 
o 
o 
c 

H 



0) 

>i 
-P 

o 
c 

0) 
XI 

04 



c 

•H 
(0 

u 

<P 
CO 



in in 
in 



mm 
in in 



*t in in in in m in 
in in in in in in 



oooooooo 

HrHHHHHHH 



00 
O 
H 



u 

(flu 0<O 

re o £ o £ o £ 

i3 & je« A4 ,c 04 

H Qi CU P< 

<U H H H 

I *0 | VO I vo I \o 

O O O n O ^ O n 



04 

o 
x: 

VD 
Cm 
ro 
H 



<0 CO 



^\ocMn<N<ocM«r>invo 
onnnnnnnnn 

COhICOCOCOCOCOCOCOCO 

uwoouuouou 





* 

c 




0 




"H 




A) 




<d 




r-4 








U 




0 




c 


— 


*H- 




i— 1 








0) 




c 




0 




-U 




-H 




M 




0) 




Q< 




(0 




U 








c 












M 




0) 








a 




•u 




a 


a 


0) 


o> 


€ 


c 


•H 


O 


U 


rH 


0) 


rH 


a 


cd 


X 




0) 


o 






u 


0 


0) 


0 




W4 




£ 


0 




iH 


O 


«-l 






H 




0) 


c 




0 


1 


-H 




-P 


c 


m 




rH 


0 


s 


■P 


u 




o 


a 


c 



E 
O 

o 

rH 

c 



E 
*o 

O 
I 

o 

H 
iH 

in 

<N 



14 
O 
>* 
*H 
> 
H 
3 
09 

*H 

0 



to 



-H «0 

•P C 

fit M 

* * 



CN 

1 • 

o 

rC CM 

a i 

o 

O JZ 

rH A 

c 

O 
in H 

S § 

o o 

M M 



O 



O vo 
CD 

O -4 

c 

o 

^* a 
in pq 
in a 



a. 
o 
x ^> 

* I 
O 



O 

c 



ii n ii n 



vo cm n in vo vo 

O <N1 CM CN rH rH CM 

o n ro n nj n O 

tO iJ W W W CO 05 

O CO U O O O O 



WO 92/11361 



PCT/US91/09604 



- 30 - 



10 



c„ 7 mom? 7 la +YP hi ° hoP * pr T" 1on Mutations 

The phoP regulon is at least partially conserved 
in S. typhi DNA hybridization studies as well as P22 
bacteriophage transductional crosses have documented that 
the phoP, phoQ, and page genes appear highly conserved 
between S. typhi and S. typhimurium nutations in these 
genes in S. typhi have been made. 

^.nnP.Ua v^^ines aq ^livery Systems for 

Heterologous A ntigens 

The vector used in the vaccine delivery .system is 

a derivative of pJM703.1 described in Miller et.al., 
1988, J. Bact. 170:2575, hereby incorporated by 
reference. This vector is an R6K derivative with a 
deletion in the pir gene. R6K derivatives require the 
15 protein product of the pir gene to replicate. E. colx 
that contain the pir gene present as a lambda 
bacteriophage prophage can support the replication of 
this vector. Cells that do not contain the pir gene wxll 
not support the replication of the vector as a plasmid. 
20 This vector also contains the mob region of RP4 which 

will allow mobilization into other gram negative bacteria 
by mating from E. coli strains such as SM10 lambda pir, 
which can provide the mobilization function in trans. 

The page region is shown in Figs. 2 and 3. Fig. 2 
25 shows the restriction endonuclease sites of the pagrC 
locus. The heavy bar indicates page coding sequence. 
The TnphoA insertion is indicated by a inverted triangle. 
The direction of transcription is indicated by the arrow 
and is left to right. The numbers indicate the location 
30 of endonuclease sites, in number of base pairs, relative 
to the start codon of predicted page translation with 
positive numbers indicating location downstream of the 
start codon and negative numbers indicating location 
upstream of the start codon. A is heel, B is Bgrll, C is 
35 ClaH, D is Pral, E is EcoKI, H is ffpal, N is Nrul, P is 
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PstI, S is Sspl, T is StuI, U is PvuII, V is EcoRV, and 
II is BgllZ. Fig. 3 shows the DNA sequence (Sequence 
I.D. No. 1) and translation of pagC: zTnphoA. The heavy 
underlined sequence indicates a potential ribosomal 
5 binding site. The single and double light underlines 
Indicate sequences in which primers were constructed 
complementary to these nucleotides for primer extension 
of RNA analysis. The asterix indicates the approximate 
start of transcription. The arrow indicates the 
10 direction of transcription. The boxed sequences indicate 

• a- region- that may -function -in polymerase- binding and— - 

recognition. The inverted triangle is the site of the 
sequenced TnphoA insertion junction. The arrow indicates 
a potential site for single sequence cleavage. 
15 3 kilobases of DNA containing the page gene (from 

the PstI restriction endonuclease site 1500 nucleotides 
5» to the start of pagC translation to the EcoRI 
restriction endonuclease site 1585 nucleotides downstream 
of page translation termination) were inserted into the 
20 pJM703.1 derivative discussed above. The pagC sequence 
. from the Clal restriction endonuclease site was deleted 
(490 nucleotides) and replaced with a synthetic 
oligonucleotide polylinker that creates unique 
restriction endonuclease sites. DNA encoding one or more 
25 heterologous proteins, e.g., an antigen, can be inserted 
into this site. This creates a vector which allows the 
insertion of multiple foreign genes into the DNA 
surrounding pagC. 

The vector can be mobilized into salmonella by 
30 mating or any other delivery system, e.g., heat shock, 
bacteriophage transduction or electroporation. .Since it 
can not replicate, the vector can only insert into 
Salmonella by site specific recombination with the 
homologous DNA on both sides of the pagC gene. This will 
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disrupt and inactivate the native pagrC locus and replace 
it with the disrupted page DNA carried on the vector. 

Such recombination events can be identified by 
marker exchange and selective media if the foreign DNA 
5 inserted into the page locus confers a growth advantage. 
- The insertion of antibiotic resistance genes for 
selection is less desirable as this could allow an 
increase in antibiotic resistance in the natural 
population of bacteria. Genes which confer resistance to 
10 substances other than antibiotics e.g., to heavy metals 
~ or arsenic (for mercury resistance, see Nucifora et al.,._. 
1989, J. Bact., 171M241-4247, hereby incorporated by 
reference), can be used to identify transf ormants . 
Alternatively, selection can be performed using a 
15 Salmonella recipient strain that carries an auxotrophic 
mutation in a metabolic pathway and a vector that carries 
DNA that compliments the auxotrophic mutation. Many 
* Salmonella live vaccine prototypes contain mutations in 
histidine or purine pathways thus complementation of 

20 these metabolic auxotrophies can be used to select for 
integrants. (Purine mutations specifically have been 
shown to be too attenuated for use in man.) Further 
proof of marker exchange can be documented by loss of the 
ampicillin resistance (carried on the plasmid backbone) 

25 or by blot hybridization analysis. 

A gene useful for selection can be cloned, by 
complementation of a vaccine strain with a metabolic 
auxotrophy. Specific examples include the cloning of the 
DNA encoding both pux-B and phoP by complementation of a 

30 strain deleted for function of both these genes. 

Salmonella gene libraries have been constructed in a 
pLAFR cosmid vector (Frindberg et al., 1984, Anal. 
Biochem. 137:266-267, hereby incorporated by reference) 
by methods known to those skilled in the art. pLAFR 

35 cosmids are broad host range plasmids which can be 
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mobilized into Salmonella from E. coli . An ntire bank 
of such strains can be mobilized into Salmonella vaccine 
strains and selected for complementation of an 
auxotrophic defect (e.g., in the case of purS growth on 
5 media without adenine) . The DNA able to complement this 
defect is then identified and can be cloned into the 
antigen delivery vector. 

As discussed above heterologous genes can be 
inserted into the polylinker that is inserted into the 

10 page sequence of the vector. The heterologous genes can 
- be- under- the-control-of- any ..of ..numerous, .environmentally-- - 
regulated promotor systems which can be expressed in the 
host and shut off in the laboratory. Because the 
expression of foreign proteins, especially membrane 

15 proteins (as are most important antigens), is frequently 
toxic to the. bacterium, the use of environmentally 
regulated promoters that would be expressed in mammalian 
tissues at high levels, but which could be grown in the 
laboratory without expression of heterologous antigens 

20 would be very desirable. Additionally, high expression* 
of antigens in host tissues may result in increased 
attenuation of the organism by diverting the metabolic 
fuel of the organism to the synthesis of heterologous 
proteins. If foreign antigens are specifically expressed 

25 in host phagocytic cells this may increase the immune 
response to these proteins as these are the cells 
responsible for processing antigens. 

The promoter systems likely to be useful include 
those nutritionally regulated promoter systems for which 

30 it has been demonstrated that a specific nutrient is not 
available to bacteria in mammalian hosts. Purines, 
Sigwart et al., 1989, Infect. Immun., 52:1858 and iron, 
Finklestein et al., 1983, Rev. Inf ct. Dis. 5:S759, e.g., 
are not available within the host. Promoters that are 

35 iron regulated, such as the aerobactin gene promoter, as 
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well as promoters for biosynthetic genes in purine 
pathways, are thus excellent candidates for testing as 
promoters that can be shut down by growth in high 
concentrations. of these nutrients. Other useful 
environmentally regulated Salmonella promoters include 
promoters for genes which encode proteins which are 
specifically expressed within macrophages, e.g., the DnaK 
and GroEL proteins, which are increased by growth at high 
temperature, as well as some phoP activated gene 
products, Buchmeier et al., 1990, Science 218:730, hereby 
rncorpor^^ Therefore, promoters -such . as 

the page 5' controlling sequences and the better 
characterized promoters for heat shock genes, e.?,« GroEL 
and DnaK, will be expected to be activated specifically 
within the macrophage. The macrophage is the site, of 
antigen processing and the expression of heat shock genes 
in macrophages and the wide conservation of heat shock 
genes in nature may explain the immunodominance of these, 
proteins, a consensus heat shock promoter sequence is 
20 known and can be used in the vectors (Cowling et: al., . 
1985, Proc. Natl. Acad. Scf. USA 82:2679 hereby 
incorporated by reference). 

The vectors can include an environmentally 
regulated T7 polymerase amplification system to express, 
25 heterologous proteins. For example, the T7 polymerase 
gene (cloned by Stan Tabor and Charles Richardson, See 
Current Protocols in Molecular Biology ed. Ausubel et 
al., 1989, (page 3.5.1.2) John Wiley. and Sons, hereby 
incorporated by reference) under control of an iron 
30 regulated promoter, can be included on the vectors 

described above. We have inserted the aerobactin gene 
promoter of E. coll with the sequence 

CATTTCTCATTGATAATGAGAATCATTATTGACATAATTGTTATTATTTTACG 
(Sequence ID No. 2), Delorenzo et al. J. Bact. 169:2624, 
35 hereby incorporated by reference, in front of the T7 
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polymerase gene and demonstrated iron regulation of the 
gene product. This version of the vector will also 
include one or more heterologous antigens under the 
control of T7 polymerase promoters. It is well known 
5 that RNA can be synthesized from synthetic 

oligonucleotide T7 promoters and purified T7 in vitro. 
When the organism encounters low iron T7 polymerase will 
be synthesized and high expression of genes with T7 
promoters will be facilitated. 
10 The paaC gene and page Gene Product 

----- strains, materials . "and" methods " The following—- 
strains, materials, and methods were used in the cloning 
of page and in the analysis of the gene and its gene 
product. 

15 Rich media was Luria broth (LB) and minimal media 

was H9, Davis et al., 1980, supra. The construction of 
S. typhimurium strain CS119 pagCl: -.TnphoA phoN2 zxx::6251 
TnlOd-Cani was previously described, Miller et al. , 1989, 
supra. American Type Culture Collection (ATCC) S. 
20 typhimurium strain 10428 included CS018 which is isogenic 
to CS119 except for phoP105 : zTnlOd, Miller et al., 1989, 
supra, CS022 pho-24, Miller et al., 1990, J. Bacteriol. 
172:2485-2490, hereby incorporated by reference, and .,. 
CS015 phoP102 : :Tnl0d-cam, Miller et al., 1989, supra. 
25 Other wild type strains used for preparation of 

chromosomal DNA included s. typhimurium LT2 (ATCC 15277) , 
s. typhimurium Ql and S. drypool (Dr. J» Peterson U. 
Texas Medical Branch, Galveston), and Salmonella typhi 
Ty2 (Dr. Caroline Hardegree, Food and Drug 
30 Administration). pLAFR cosmids were mobilized from E. 
coli to S. typhimurium using the E. coli strain MM294 
containing pRK2013, Friedman et al., 1982, Gene 18:289- 
296, hereby incorporated by reference. Alkaline 
phosphatase (AP) activity was screened on solid media 
35 using the chromogenic phosphatase substrate 5-bromo-4- 
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chloro-3-indolyl phosphate (XP) . AP assays were 
performed as previously described, Brickman et al., 1975, 
j. Mol. Biol. 96:307-316, hereby incorporated by 
reference, and are reported in units as defined by 
Miller, Miller, 1972, supra, pp. 352-355. 

One dimensional protein gel electrophoresis was 
performed by the method of Laemmli, 1970, Nature, 
227:680-685, hereby incorporated by reference, and blot 
hybridization using antibody to AP was performed as 
-previously described, Peterson et al.^l?88, Infect. 

immun. 5J5: 2822-2829 , hereby incorp-orated" by reference, - - 

Whole cell protein extracts were prepared, from saturated 
cultures grown in LB at 37«C with aeration, by boiling 
the cells in SDS-pagE sample buffer, Laemmli, 1970, 
15 supra. Two dimensional gel electrophoresis was performed 
by the method of O'Farrell, 1975, J. Biol. Chem. 
250:4007, hereby incorporated by reference: Proteins in 
the 10% polyacrylamide slab gels were visualized by 
silver staining, Merril et al., 1984, Methods in 
Enzymology, 104:441, hereby incorporated by reference. 

Chromosomal DNA was prepared by the method of 
Mekalanos, 1983, Cell, 11:253-263, hereby incorporated by 
reference. DNA, size fractionated in agarose gels, was 
transferred to nitrocellulose (for blot hybridization) by 
the method of Southern, 1975, J. Mol. Biol. 98:503-517, 
hereby incorporated by reference. DNA probes for 
Southern hybridization analysis were radiolabeled by the 
random primer method, Frinberg et al. , 1984, supra. 
Plasmid DNA was transformed into E. coli and Salmonella 
by calcium chloride and heart shock, Mekalanos, 1983, 
supra, or by electroporation using a Genepulser apparatus 
(Biorad, Richmond, Ca.) as recommended by the 
manufacturer, Dower et al., 1988, Nucl. Acids Res. 
16:6127-6145, hereby incorporated by reference. DNA 
sequencing was performed by the dideoxy chain termination 



20 



25 
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method of Sanger et al. , 1977, Proc. Natl. Acad. Sci. 
USA, 74:5463-5467, hereby incorporated by ref rence, as 
modified for use with Sequenase (U.S. Biochemical, 
Cleveland, Ohio) . Oligonucleotides were synthesized on 
5 an Applied Biosystems Machine and used as primers for 
sequencing reactions and primer extension of RNA. 
Specif ic primers unique to the two ends of TnphoA, one of 
which corresponds to the alkaline phosphatase coding, 
sequence and the other to the right IS50 sequence, were 

10 used to sequence the junctions of the transposon 

■ •insertion.- -■ „_.__-.-, — ._ . _ ... . 

Construction of a S. typhimurium cosmid gene bank 
in pLAFR3 and screening for clones containing the wild 
type page DNA was performed as follows . DNA from S . 

15 typhimurlum strain ATCC 10428 was partially digested 
using the restriction endonuclease Sau3A arid then size 
selected on 10-40% sucrose density gradient. T4 DNA 
ligase was used to ligate chromosomal DNA of size 20-30 
kilobases into the cosmid vector pLAFR3, a derivative of 

20 pLAFRl , Friedman et al., 1982, Gene 11:289-296, hereby 
incorporated by reference, that was digested with the 
restriction endonuclease BamHI. Cosmid DNA was packaged 
arid transfected into E. coli strain DH5-a using extracts 
purchased from Stratagene, La Jolla, ca. colonies were 

25 screened by blot hybridization analysis. 

The analysis of proteins produced from cloned DNA 
by in vitro transcription/translation assays was analyzed 
as follows. These assays were performed with cell free 
extracts, (Amersham, Arlington Heights, Illinois), and 

30 were performed using conditions as described by the 

manufacturer. The resultant radiolabeled proteins were 
analyzed by SDS-pagE. 

RNA was purified from early log and stationary 
.phase Salmonella cultures by the hot phenol method, Case 

35 et al., 1988, Gene 72:219-236, hereby incorporated by 
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referenc , and run in agarose-f ormaldehyde gels for blot 
hybridization analysis, Thomas, 1980, Proc. Natl. Acad, 
sci. USA 72:5201, hereby incorporated by reference. 
Primer extension analysis of RNA was performed as 
5 previously described, Miller et al., 1986, Nuc. Acids. 
Res. 14:7341-7360, hereby incorporated by reference, 
using AMV reverse transcriptase (Promega, Madison, 
Wisconsin) and synthesized oligonucleotide primers 
complementary to nucleotides 335-350 and 550-565 of the 

10 ~ page locus . , _ — — •.. . — . . . ..... 

T^irinati ^ ^ „n 18 kDa protein musing in a 

. of c j-ynhimurium pagrC mutant strain CS119 

was analyzed by two dimensional protein electrophoreses 

to detect protein species that might be absent as a 

15 result of the TnphoA insertion. Only a single missing 
protein species, of approximately 18 kD and pI-8.0, was 
observed when strains, isogenic except for their 
transposon insertions, were subjected to this analysis. 
This 18 kDa species was also missing in similar analysis 

20 of salmonella strains with mutations phoP and phoQ, 
Though two-dimensional protein gel analysis might not 
detect subtle changes of protein expression in strain 
CS119, this suggested that a single major protein species 
was absent as a result of the pagC::TnphoA insertion. 

25 Additional examination of the 2-dimensional gel 

analysis revealed a new protein species of about 45 kDa 
that is likely the pagC-Ap fusion protein. The pagC-AP 
fusion protein was also analyzed by Western blot analysis 
using antisera to AP and found to be similar in size to 

30 native AP (45 kDa), and not expressed in PhoP-S. 
typhimurium. 

nmnim of panC'iTnr ^i insertion Chromosomal 
DNA was prepared from 5. typhimurium strain CS119 and a 
rough physical map of the restriction endonuclease sites 
35 in the region of the pagC::TnphoA fusion was determined 
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by using a DNA fragment of TnphoA as a probe in blot 
hybridization analysis. This work indicated that 
digestion with the restriction endonuclease ecoRV yielded 
a single DNA fragment that included the pagC: : TnphoA 
5 insertion in addition to several kilobases of flanking 
DNA. Chromosomal DNA from strain CS119 was digested with 
EcoRV (blunt end) and ligated into the bacterial plasmid 
vector pUC19 (New England Biolabs) that had been digested 
with the restriction endonuclease Smal (blunt end) . This 

10 DNA was electroporated into the E. coli strain DH5r-ot 
(BRL) " and colonies -were- plated onto LB^ agar containing 
the antibiotics kanairiycin (TnphoA encoded and ampicillin 
(pUC19 encoded) . A single ampicillin and kanamycin 
resistant clone containing a plasmid designated pSMlOO 

15 was selected for further study. 

A radiolabeled DNA probe from pSMlOO was 
constructed and used in Southern hybridization analysis 
of strain CS119 and its wild type parent ATCC 10428 to 
prove that the pagC: : TnphoA fusion had been cloned. The 

20 probe contained sequences immediately adjacent to the 
transposon at the opposite end of the alkaline 
phosphatase gene [Hpal endonuclease generated DNA 
fragment that included 186 bases of the right IS50 of the 
transposon and 1278 bases of Salmonella DNA (Fig. 2) . As 

25 expected, the pSMlOO derived probe hybridized to an 11- 
12 kb AccJ endonuclease digested DNA fragment from the 
strain containing the transposon insertion, CS119. This 
was approximately 7.7kb (size of TnphoA) larger than the 
3.9 kB AccI fragment present in the wild type strain that 

30 hybridizes to the probe. In addition, a derivative of 

plasmid pSMlOO, pSMlOl (which did not allow expression of 
the pagC-PhoA gene fusion off the lac promoter) , was 
transformed into phoP- (strain Cs015) and phoN- (strain 
CS019) Salmonella strains and the cloned AP activity was 

35 found to be dependent on phoP for expression. Therefore 
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we concluded that the cloned DNA contained the 
pagC::TnphoA fusion. 

The presence of the page gene was also 
demonstrated in other strains of 3. typhimurium, as well 
5 as in 5. typhi, and S. drypool. All Salmonella strains 
examined demonstrated similar strong hybridization to. an 

8.0 kb ScoRV and a 3.9 kb Accil restriction endonuclease 
fragment suggesting that page is a virulence gene common 

to salmonella species. . _ 

10 The page gene probe_from nucleotides -46 (with 1 

as the first bal•^•MMtd-BO^(^^*lt•■tft-. 
the sglll site) failed to cross hybridize to DNA from 
Citrobacter freundii, Shigella flexneri, Shigella sonnei, 
Shigella dysenterial, Escherichia coli, Vibrio cholerae f 
15 vibrio vulnificus, Yersenia entero colitica, and 

Klibsiella pneumonia. 

m^nn of ^ ™™ fcvpp naorc 1 ocus DNA and its 
^ pi^nfcat ^™ ^ the virnlennR defect of a S. 
f Vp M m .W UM P ™r. mutant The same restriction 

20 endonuclease fragment described above was used to screen 
a cbsmid gene bank of wild type strain ATCC 10428. A 
single clone, designated pWP061, contained 18 kilobases 
of s. typhimurium DNA and hybridized strongly to the pagC 
DNA probe. pWP061 was found to contain Salmonella DNA 

25 identical to that of pSMlOO when analyzed by restriction 
endonuclease analysis and DNA blot hybridization studies. 
Probes derived from pWP061 were also used in blot 
hybridization analysis with DNA from wild, type and CS119 
5. typhimurium. identical hybridization patterns were 

30 observed to those seen with pSMlOO. pWP061 was also 

mobilized into strain CS119, a page mutant strain. The 
resulting strain had wild type virulence for BALB/c mice 
(a LD S0 less than 20 organisms when administered by IP 
injection) . Therefore the cloned DNA complements the 

35 virulence defect of a pagC mutant strain. 
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Since , a wild type cosmid containing page locus 
DNA was found to complement the virulence defect of a 
page mutant S. typhimurium strain, it was concluded that 
the page protein is an 188 amino acid (18 kDa) membrane 
5 (see below) protein essential for survival within 
microphages and virulence of S. typhimurium. 

Physical mapping of restriction endo nuclease 
sites, DNA sequencing, and determinat ion of the pagC gene 
product Restriction endonuclease analysis of plasmid 

10 pSMlOO and pWP061 was performed to obtain a physical map 

of ~ the page Ibcus; and, "in the case of PSMlOO, to 

determine the direction of transcription (Fig. 2) . DNA 
subclones were generated and the TnphoA fusion junctions 
were seguenced, as well as the Salmonella DNA extending 

15 from the Hpai site, 828 nucleotides 5' to the phoA fusion 
junction, to the EcoRI site 1032 nucleotides 3 1 to the 
TnphoA insertion (Fig. 2 and 3). The correct reading 
frame of the DNA sequence was deduced from that required 
to synthesize an active AP gene fusion. The deduced 

20 amino acid sequence of this open reading frame was 
predicted to encode a 188 amino acid protein with a 
predicted pI+8 . 2. This data were consistent with the 2- 
D polyacrylamide gel analysis of strain CS119 in which an 
18 kDa protein of approximate pI+8.0 was absent. No 

25 other open reading frames, predicted to encode peptides 
larger than 30 amino acids, were found. 

The deduced amino acid sequence of the 188 amino 
acid open reading frame contains a methionine start codon 
33 amino acids from the fusion of pagC and AP (Fig. 3). 

30 This 33 amino acid page contribution to the fusion 

protein was consistent with the size observed in Western 
blot analysis and contains a hydrophobic N-terminal 
region, identified by the method of Kyle et al. f 1982, J. 
Mol. Biol. 157 :105-132, hereby incorporated by reference, 

35 that is a typical bacterial signal sequence, Von Heinje, 
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1985, J. Mol. Biol. 184:99-105, hereby incorporated by 
reference. Specifically, amino acid 2 is a positively 
charged lysine, followed by a hydrophobic domain and 
amino acid 24 is a negatively charged aspartate residue. 
A consensus cleavage site for this leader peptide is 
predicted to be at an alanine residue at amino acid 23, 
Von Heinje, 1984, J. Mol. Biol. 173:243-251, hereby, 
incorporated by reference. The DNA sequence also 
revealed a typical ribosomal binding site, Shine et al. , 
1974, Proc. Natl. Acad. Sci. USA 71:1342-1346, hereby 
incorporated by reference , at 6-2 nucleotides 5- to_the. 
predicted start of translation (Fig. 3) nucleotides 717- 
723) . This suggested that the open reading frame was, in 
fact, translated and further supported the assumption 
15 that this was the deduced amino acid sequence of the page 
protein interrupted by the TnphoA insertion (Fig. 3). 

Tn vitro synthesis of protei ns by the cloned page 
locus to detect if other proteins were encoded by page 
and to determine the approximate size of the pagrC gene 
20 product, an in vitro coupled transcription/translation 

analysis was performed. A 5.3 kilobase EcoRT fragment of 
PWP061 was inserted into pUC19 so that the page gene 
would not be expressed off the lac promotor. This 
plasmid was used in an in vitro coupled transcription- 
25 translation assay. A single protein of approximately 22 
kilodaltons was synthesized by the cell free system. The 
size was compatible with this being the precursor, of the 
. pagC protein containing its leader peptide. These data 
further support the conclusion the single and the single 
30 page gene product had been identified. 

Identification of the paor f- gnnoded RNA .An 
approximately 1100 nucleotide RNA is encoded by page. , 
The page gene is highly expressed by cells with a phoP 
constitutive phenotype of pag activation, as compared to 
35 wild type and phoP constitutive phenotype of pag 
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activation, as compared to wild type and phoP- bacteria, 
in these blot hybridization experiments page is only 
detected in wild type cells grown in rich media during 
stationary growth. This result, coupled with previous 
5 work, Miller et al. f 1989, supra, Miller et al., 1990, 
supra, demonstrates that page is transcriptionally 
regulated by the phoP gene products and is only expressed 
during early logarithmic phase growth in rich media by 
cells with a phoP constitutive phenotype. 

!0 The size of the page transcript is approximately 

"500 nucleotides greater than -that necessary _*o_ encode t he _ 
188 amino acid protein. Primer extension analysis of 
Salmonella RNA using oligonucleotide primers specific for 
page sequence was performed to determine the approximate 

15 start site of transcription and to determine whether 
these nucleotides might be transcribed 5 1 or 3' to the 
188 amino acid page gene product. Primer extension 
analysis with an oligonucleotide predicted to be 
complementary to nucleotides 550-565 of pagC, 150 

20 nucleotides 5' to the predicted start codon, resulted in 
an approximately 300 nucleotide primer extension product. 
Therefore a primer further upstream was constructed 
complementary to nucleotides 335-350 of pagC and used in 
a similar analysis. A primer extension product of 180 

25 nucleotides was observed to be primer specific. This is 
consistent with transcription starting at nucleotide 170 
(Fig. 3) . Upstream of the predicted transcriptional 
start, at nucleotides 153-160, a classic RNA polymerase 
binding site was observed with the sequence TATAAT at - 

30 12 nucleotides as well as the sequence TAATAT at -10 

nucleotides. No complete matches were observed for the 
consensus RNA polymerase recognition site (TTGACA) 15-21 
nucleotides upstream from the -10 region. AT -39 (126- 
131) nucleotides (TTGGAA) , -38 (127-132) nucleotides 

35 (TTGTGG) , and -25 (135-140) nucleotides (TTGATT) are 
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sequences that have matches with the most frequently 
conserved nucleotides of this sequence. 

Based on the above results transcription was 
predicted to terminate near the translational stop codon 
5 of the 188 amino acid protein (nucleotide 1295, Fig. 3). 
Indeed, a stem loop configuration was found at 
nucleotides 1309-1330 that may function as a 
transcription terminator. This was consistent with the 
lack of evidence of open reading frames downstream of the 

10 188 amino, acid protein and the lack of synthesis of other 

• ~" using the" "cloned page DNA-. 

This further suggests that the pagC: :TnphoA insertion 
inactivated the synthesis of only a single protein. 

■cHm-ilaritY P^aC &i1 and Lom A computer 

15 analysis of protein similarity using the National 

Biomedical Research Foundation/Protein Identification 
Resource/George et al., 1986, Nucleic Acids Res. 14: Il- 
ls, hereby incorporated by reference, protein sequence 
base was conducted to identify other proteins that had 

20 similarity to page in an attempt to find clues to the 

molecular function of this protein. Remarkably, page was 
found to be similar to a bacteriophage lambda protein, 
Lorn, that has been localized to the outer membrane in 
minicell analysis, Court et al., 1983, Lambda II, 

25 Hendrix, R.W. et al. ed. Cold Spring Harbor Laboratory 

(cold spring Harbor NY), pp. 251-277, hereby incorporated 
by reference, and demonstrated to be expressed by lambda 
lysogens of E. coli, Barondess, et al. , 1990, Nature 
34§: 871-874, hereby incorporated by reference. Recently, 

30 the deduced amino acid sequence of the cloned ail gene 
product of Y. enterocolitis was determined and found to 
also be similar to Lom, Miller et al., 1990b, J. 
Bacterid. 172:1062-1069. Therefore, a protein family 
sequence alignment was performed using a computer 

35 algorithm that establishes protein sequence families and 
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consensus sequences, Smith et al., 1990 , Proc. Natl. 
Acad, Sci. 87: 118-122 , hereby incorporated by reference. 
The formation of this family is indicated by the internal 
data base values of similarity between these proteins : 
5 page and Lorn (107.8), page and Ail (104.7), and Ail and 
Lorn (89.8). These same proteins were searched against 
314 control sequences in the data base and mean values 
and ranges were 39.3 (7.3-52.9) page, 37.4 (7.3-52.9) 
Ail, and 42.1 (7.0-61.9) Lorn. The similarity values for 
10 this protein family are all greater than 3.5 standard 

- — deviations above -the highest score, .obtained for 

similarity to the 314 random sequences. No other 
similarities or other family members were found in the . 
database. Regions of similarity are located not only in 
15 the leader peptide transmembrane domains but throughout 
the protein. 

paaC Mutant Strains Are Attenuated For Vi rulence 
Salmonella typhimurium strains with a pagC 
mutation are most likely inactivated for the phoP- 
20 / regulated gene product, as these strains are attenuated 
for virulence by at least 1,000-fold. 

Attenuation of Bacterial Virulence bv Constitutive 
Expression of Two-component Regulatory Systems . 

The virulence of a bacterium can be attenuated by 
25 inducing a mutation or which results in the constitutive 
expression of genes under the control of a two-component 
regulatory system or by inducing a mutation that 
inactivates a gene under the control of the two-component 
systems. A balance between the expression of the genes 
30 under the control of the two-component system, e.g., 

between pag and prg gene expression, and possibly beteen 
two-component system regulated genes and other genes, is 
necessary for full virulence. Mutations that disrupt 
this balance, e.g., mutations that cause the constitutive 
35 expression of a gene under the control of the two- 
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component system, or a mutation that inactivates a gene 
under the control of the two-component system, e.g., the 
pag gene, reduce virulence. 

Constitutive mutations in two-component 
5 regulators can be identified by the use of a strain 

containing a recorder gene fusion to a gene regulated by 
the two-component system. Such gene fusions would most 
typically include DNA encoding the lacZ gene or alkaline 
phosphatase fused to a gene under the control of the two- 

10 component system. Strains containingjusions that are 
"(as compared to wild "type" or -parental strains) highly t 
expressed in an unregulated fashion, i.e., constitutive, 
can be detected by increased color on chromogenic 
substrates for the enzymes. To detect constitutive 

15 . mutations a cloned virulence regulator could be 

mutagenized e.g., by passage through an E- coll strain 
defective in DNA repair or by chemical mutagenesis. The 
mutated DNA for the regulator would then be transferred 
to the strain containing the gene fusion. and constitutive 

20 mutations identified by the high gene fusion expression 
(blue color in the case of a lacZ fusion grown on media 
containing X-gal) . Constitutive mutations in a component 
of a two-component regulatory system could also be made 
by in vitro mutagenesis after other constitutive 

25 mutations have been sequenced and a specif ic amino acid 
change responsible for const itutivity identified. 
Putting several amino acid changes that all result in a 
PhoP constitutive phenotype would result in a decreased 
frequency of reversion by spontaneous base changes. A 

30 constitutive mutation could also be constructed by 
deletion of the portion of the amino terminus of the 
phospho-accepting regulator which contains the 
phosphoacceptor domain e.g. , deletion of sequences 
encoding amino acids amino terminal to amino acid 119 in 

35 the phoP gene or deletion of analogous phospho accepting 
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sequences in genes of other two-component regulatory 
systems. This could result in a conformational change 
similar to that induced by phosphorylation and result in 
increased DNA binding and transcriptional activation. 
5 Use 

The Salmonella cells of the invention are useful 
as sources of immunological protection against diseases, 
e.g., typhoid fever and related diseases, in an animal, 
e.g., a mammal, e.g., a human, in particular as the basis 

10 of a live-cell vaccine capable of colonizing the 

inoculated- aniraalJ s . intestine, and -provoking _a_ strpng__._... 

immune reaction. Appropriate dosages and conditions of 
administration of such a live, attenuated vaccine are as 
described in Holem et »i - f ,Amte Enteric Infections in 

15 children. New Prospects for Treat ment and Prevention 

(1981) Elsevier /North-Holland biomedical Press, Ch. 26, 
pp. 443 et seq. (Levine et al.) ; hereby incorporated by 
reference. 

Other Embodiments 
20 Other embodiments, e.g., strains which in addition 

to a phoP related mutation or genetic alteration also 
contain an attenuating mutation in another gene, e.g., an 
aromatic amino acid synthetic gene, e.g., aroA or aroD, 
or in cya gene (adenylate cyclase) or crp gene (adenylate 
25 cyclase receptor) are also within the claims, 
What is claimed is: 
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COMPUTER SUBMISSION OF DNA AND AMINO ACID SEQUENCES 
(1) GENERAL INFORMATION: 



Miller, Samuel I. 
Mekalanos, John J. 

Improved Vaccines 
2 



(i) APPLICANT: 

(ii) TITLE OF INVENTION: 

(iii) NUMBER OF SEQUENCES: 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 

(C) CITY: 
- (D)- STATE: .... _ 

(E) COUNTRY: 

(F) ZIP CODE: 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 
CO OPERATING SYSTEM: 

(D) SOFTWARE: 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE /DOCKET NUMBER: 00786/065001 



Fish & Richardson 
225 Franklin Street 
Boston 

Massachusetts 
uVs.AT r ~ " 
02110-2804 



3.5" Diskette, 1.44 Mb storage 
IBM PS/2 Model 50Z or 55SX 
IBM P,C. DOS (Version 3.30) 
WordPerfect (Version 5.0) 



(ix) TELECOMMUNICATION INFORMATION: 



(A) TELEPHONE: 

(B) TELEFAX: 

(C) TELEX: 



(617) 542-5070 
(617) 542-8906 
200154 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2320 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : a ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 1: 

GTTAACCACT CTTAATAATA ATGGGTTTTA TAGCGAAATA CACTTTTTTA TCGCGTGTTC 60 

AATATTTGCG TTAGTTATTA TTTTTTTGGA ATGTAAATTC TCTCTAAACA CAGGTGATAT 120 

TTATGTTGGA ATTGTGGTGT TGATTCTATT CTTATAATAT AACAAGAAAT GTTGTAACTG 180. 
^MAG ATMAlT TAAMlGATT A AATCGG AGGG "GG AATAAAG C " GTGCTAAGCA TCATCGTGAA^ - 2 4 0 

TATGATTACA GCGCCTGCGA TGGCATATAA CCGTATTGCG GATGGAGCGT CACGTGAGGA 300 

CTGTGAAGCA CAATGCGATA TGTTCTGATT ATATGGCGAG TTTGCTTAAT GACATGTTTT 360 

TAGCCGAACG GTGTCAAGTT TCTTAATGTG GTTGTGAGAT TTTCTCTTTA AATATCAAAA 420 

TGTTGCATGG GTGATTTGTT GTTCTATAGT GGCTAAAGAC TTTATGGTTT CTGTTAAATA 480 

TATATGCGTG AGAAAAATTA GCATTCAAAT CTATAAAAGT TAGATGACAT TGTAGAACCG 540 

GTTACCTAAA TGAGCGATAG AGTGCTTCGG TAGTAAAAAT ATCTTTCAGG AAGTAAACAC 600 

ATCAGGAGCG ATAGCGGTGA ATTATTCGTG GTTTTGTCGA TTCGGCATAG TGGCGATAAC 660 

TGAATGCCGG ATCGGTACTG CAGGTGTTTA AACACACCGT AAATAATAAG TAGTATTAAG 720 

GAGTTGTT 728 

ATG AAA AAT ATT ATT TTA TCC ACT TTA GTT ATT ACT ACA AGC GTT TTG 776 
Met Lye Asn He He Leu Ser Thr Leu Val He Thr Thr Ser Val Leu 
5 10 15 

GTT GTA AAT GTT GCA CAG GCC GAT ACT AAC GCC TTT TCC GTG GGG TAT 824 
Val Val Asn Val Ala Gin Ala Asp Thr Asn Ala Phe Ser Val Gly Tyr 
20 25 30 

GCA CGG TAT GCA CAA AGT AAA GTT CAG GAT TTC AAA AAT ATC CGA GGG 872 
Ala Arg Tyr Ala Gin Ser Lys Val Gin Asp Phe Lys Asn He Arg Gly 
35 40 45 

GTA AAT GTG AAA TAG CGT TAT GAG GAT GAC TCT CCG GTA AGT TTT ATT 920 
Val Asn Val Lys Tyr Arg Tyr Glu Asp Asp Ser Pro Val Ser Phe He 
50 55 60 

TCC TCG CTA AGT TAC TTA TAT GGA GAC AGA CAG GCT TCC GGG TCT GTT 968 
Ser Ser Leu Ser Tyr Leu Tyr Gly Asp Arg Gin Ala Ser Gly Ser Val 
65 70 75 80 
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GAG COT GAA GGT ATT CAT TAC CAT GAC AAG TTT GAG GTG AAG TAC GGT 
Glu Pro Glu Gly He His Tyr His Asp Lys Phe Glu Val Lye Try oxy 
85 90 

TCT TTA ATG GTT GGG CCA GCC TAT CGA TTG TCT GAC AAT TTT TCG TTA 
Ser leu Met Val Gly Pro Ala Tyr Arg Leu Ser Asp Asn Phe 5er Leu 

„„„ _„ MT GTC ggc ACG GTA AAG GCG ACA TTT AAA GAA CAT 

Sr S Su Ala S S X Thr Val Lys Ala Thr Phe Lys Glu His 
115 120 125 

TCC ACT CAG GAT GGC GAT TCT TTT TCT AAC AAA ATT TCC TCA AGG AAA 
ser T*hr Tin Asp Gly Asp Ser Phe Ser Asn Lys lie Ser Ser Arg Lys 



130 



135 



ACG GGA TTT GCC TGG GGC GCG GGT GTA CAG ATG AAT CCG CTG GAG AAT 
Thr pS Ala Trp Gly Ala Gly Val Gin Met Asn Pro Leu Glu A n 
145 150 155 

ATC GTC GTC GAT GTT GGG TAT GAA GGA AGC AAC ATC TCC TCT ACA AAA 
Tlt Zt Val Asp val Gly Tyr Glu Gly Ser Asn lie Ser Ser Thr Lys 
165 I 70 175 

ATA AAC GGC TTC AAC GTC GGG GTT GGA TAC CGT TTC TGA AAAGC 
lie Asn Gly Phe Asn Val Gly Val Gly Tyr Arg Phe 
180 I 85 



1016 



1064 



1112 



1160 



1208 



1256 



1300 



AT AAG CT ATG 


CGGAAGGTTC 


GCCTTCCGCA 


CCGCCAGTCA ATAAAACAGG 


GCTTCTTTAC 


1360 


CAGTGACACG 


TACCTGCCTG 


TCTTTTCTCT 


CTTCGTCATA CTCTCTTCGT 


CATAGTGACG 


1420 


CTGTACATAA 


CATCTCACTA 


GCATAAGCAC 


AGATAAAGGA TTGTGGTAAG 


CAATCAAGGT 


1480 


TGCTCAGGTA 


GGTGATAAGC 


AGGAAGGAAA 


ATCTGGTGTA AATAACGCCA 


GATCTCACAA 


1540 


GATTCACTCT 


GAAAAATTTT 


CCTGGAATTA 


ATCACAATGT CATCAAG ATT 


TTGTGACCGC 


1600 


CTTCG CAT AT 


TGTACCTGCC 


GCTGAACGAC 


TACTGAAAAG TAGCAAGGTA 


TGTATTTTAT 


1660 


CCAGGAGAGC 


ACCTTTTTTG 


CGCCTGGCAG 


AAGTCCCCAG CCGCCACTAG 


CTCAGCTGGA 


1720 


TAGAGCATCA 


ACCTCCTAAG 


TTGATGGTGC 


GAGGTTCGAG GCCTCGGTGG 


CGGTCCAATG 


1780 


TGGTTATCGT 


ATAATGTTAT 


TACCTCAGTG 


TCAGGCTGAT GATGTGGGTT 


CGACTCCCAC 


1840 


TGACCACTTC 


AGTTTTGAAT 


AAGTATTGTC 


TCGCAACCCT GTTACAGAAT 


AATTTCATTT 


1900 


ATTACGTGAC 


AAGATAGTCA 


TTTATAAAAA 


ATGCACAAAA ATGTTATTGT 


CTTTTATTAC 


1960 


TTGTGAGTTG 


TAGATTTTTC 


TTATGCGGTG 


AATCCCCCTT TGCGGCGGGG 


CGTCCAGTCA 


2020 


AATAGTTAAT 


GTTCCTCGCG 


AACCATATTG 


ACTGTGGTAT GGTTCACCGG 


GAGGCACCCG 


2080 
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GCACCGCAAT TTTTTATAAA ATGAAATTCA CACCCTATGG TTCAGAGCGG TGTCTTTTTA 2140 

CATCAGGTGG GCAAGCATAA TGCAGGTTAA CTTGAAAGAT ACGATCAATA GCAGAAACCA 2200 

GTGATTTCGT TTATGGCCTG GGGATTTAAC CGCGCCAGAG CGTATGCAAG ACCCTGGCGC 2260 

GGTTGGCCGG TGATCGTTCA ATAGTGCGAA TATGAATGGT TACCAGCCGC CTGCGAATTC 2320 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 
. —( B )— TYPE : . _ 

(C) STRANDED NESS : 

( D ) TOPOLOGY : 

(ii) SEQUENCE DESCRIPTION: SEQUENCE ID NO: 2: 



53 

nucleic acid 

Fingils 

linear 



CATTTCTCAT TGATAATGAG AATCATTATT GACATAATTG TTATTATTTT ACG 



53 
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Claims 

x 1. A vaccine comprising a Salmonella cell the 

2 virulence of which is attenuated by a first mutation in 

3 the phoP regulatory region causing constitutive 

4 expression of a gene under the control of said region and 

5 by a second mutation at an aro, pag, or prg gene. 



1 

2 



2. A vaccine comprising a Salmonella cell the 
virulence of which is attenuated by a mutation in a pag 
3 or a prg gene and by a mutation in an aro gene. 

x 3 . a Salmonella^ cell which : cdhstitutively 

2 expresses a phoP regulatory region regulated gene and 

3 which comprises a virulence attenuating mutation in an 

4 aro, a prg, or a pag gene. 



1 
2 



4. A Salmonella cell which comprises a first 
virulence attenuating mutation in a pag or a prg gene and 

3 a second virulence attenuating mutation in an aro gene. 

! 5. A live Salmonella cell in which there is 

2 inserted into a pag or a prg gene a gene encoding a 

3 heterologous protein, or a regulatory element, of said 

4 heterologous protein gene. 

! 6. The live Salmonella cell of claim 5, wherein 

2 said DNA encoding a heterologous protein is under the 

3 control of an environmentally regulated promoter. 

x 7 . A vector capable of integrating into the 

2 chromosome of Salmonella comprising 

3 a first DNA sequence encoding a heterologous 

4 protein, 

, 5 a second DNA sequence encoding a marker, and 

6 a third DNA sequence encoding a product necessary 

7 for virulence ■, said third DNA sequence being mutationally 

8 inactivated. 
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1 8". A vector comprising DNA which encodes the page 

2 gene product. 

1 9. A purified preparation of the pagC gene 

2 product. 

1 10. A method of detecting the! presence of 

2 Salmonella in a sample comprising contacting said sample 

3 with pagC encoding DNA and detecting the hybridization of 

4~ " said "pagC encoding DNA to nucleic acid -in. said_ sample ... _ 
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10 20 30 AO 5tf 60 70 

GTTAACCACT CTTAATMTA ATGGGTTTTA' TACCGAAATA TACTTTTTTA TCGCGTCTTC AATATTTCCC 

80 90 100 110 120 130 140 



TTACTTATTA TTTTTTTGGA ATGTAAATTC TCTCTAAACA CAGCTCATAT TTATCTTGGA ATTCTCGTGT 



150 . 160 170 180 190 200 210 

TCATTCTATT CTTATAATAT] AACAACAAAT . GTTCTAACTG ATAGATATAT TAAAACATTA AATCCGAGCG 

220 230 240 250 260 270 280 

GGAATAAAGC CICCTAAGCA TCATCCTGAA TATCATTACA GCGCCTGCGA TGGCATATAA CCGTATTCCG 

290 300 310 320 330 340 350 

CATCCAGCGT CACGtGAGCA CTGTGMGCA CAATCCCATA TGTTCTGATT ATA TGGCCAG TTTCCTTAAT 

360 370 380 390 400 410 420 

CACATGTTTT TAGCCGAACG GTGTCAAGTT TCTTAATGTG CTTGTGAGAT TTTCTCTTTA AATATCAAAA 

430 440 450 460 470 480 490 

TGTTGCATGG GTGATTTGTT CTTCTATAGT CCCTAAACAC TTTATCCTTT CTCTTAAATA TATATGCGTG 

500 510 520 530 540 550 560 

ACAAAAATTA CCATTCAAAT CTATAAAAGT TAGATCACAT TGTAGAACCC GTTACCTA AA TGAGCCATAC 

570 580 590* 600 610 620 630 

ACTCCT TCGG TAGTAAAAAT ATCTTTCAGC AAGTAAACAC ATCAGCAGCG ATACCGGTGA ATTATTCGTG 

6U0 650 660 670 680 690 700 

GTTTTGTCGA TTCCCCATAG TCGCGATAAC TGAATGCCGG ATCGGTACTG CAGGTCTTTA AACACACCGT 

710 720 728 

AAATAATAAG TAGTA TTAAC CACT TGTT 

ATG AAA AAT ATT ATT TTA TCC ACT TTA GTT ATT ACT ACA ACC GTT TTC GTT GTA 782 
MET LYS ASN ILE ILE LEU SER THR LEU VAL ILE THR THR SER VAL LEU VAL VAL 18 

AAT GTT CCA CAG GCC CAT ACT AAC GCC TTT TCC GTC GGG TAT GCA C^G TAT CCA 836 
ASK VAL ALA CLK ALA ASP THR ASK ALA PHE SER VAL GLY TYR ALA ARC TYR ALA 36 

CAA ACT AAA GTT CAG GAT TTC AAA AAT ATG CCA GGG GTA AAT GTC AAA TAC CGT 890 
GLN SER LYS VAL CLN ASP PHE LYS ASM ILE ARC GLY VAL ASH VAL LYS TYR ARC 54 

TAT GAO CAT OAC TCT CCG GTA ACT TTT ATT TCC TCC CTA AGT TAC TTA TAT GGA 944 
TYR GLU ASP ASP SER PRO VAL SER PHE ILE SER SER LEU SER TYR LEU TYR GLY 72 

GAC ACA CAG GCT TCC GGG TCT GTT GAG CCT CAA GGT ATT CAT TAC CAT GAC AAG 998 
ASP ARC CLN ALA SER GLY SER VAL GUI PRO GLU GLY ILE HIS. TYR HIS ASP LYS 90 

TTT GAG GTG AAC TAC GGT TCT TTA ATG GTT GGG CCA CCC TAT CCA TTC TCT GAC 1052 
PHE CLU VAL LYS TYR GLY SER LEU WET VAL CLY PRO ALA TYR ARC LEU SER ASP 108 

AAT TTT TCG TTA TAC CCG CTC CCG OCT GTC GCC ACG GTA AAG CCG ACA TTT AAA 1106 
ASH PHE SER LEU TYR ALA LEU ALA GLY VAL CLY THR VAL LYS ALA THR PHE LYS 126 

CAA CAT TCC ACT CAG GAT CCC CAT TCT TTT TCT AAC AAA ATT TCC TCA AGO AAA 1160 
GLU H2S SER THR CLK ASP CLY ASP SER PHE SER ASH LYS ILE SER SER ARO LYS 144 



ACG CCA TTT CCC TCG GGC CCG GGT GTA CAG ATG AAT CCC CTG GAC AAT ATC CTC 1214 
THR GLY PHE. ALA TRP GLY ALA CLY VAL CLN MET ASM PRO LEU CLU ASH ILE VAL 162 
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CTC CAT CTT CCC"" CAA CCA ACC AAC ATC TCC TCT Af \AA ATA AAC GCC TTC 1268 
VAL ASP VAL GLY CLU GLY SER ASN ILE SER SER THi. LYS ILE ASN GLY PHE 180 

AAC CTC GCG ClT CCA TAC CCT TTC TCA AAAGC 1300 
ASN VAL GLY VAL CLY TYR ARC PHE 188 

1310 1320 1330 1340 1350 1360 1370 

ATAAGCTATG CGGAACCTTC GCCTTCCGCA CCGCCAGTCA ATAAAACAGG GCTTCTTTAC CAGTCACACG 

1380 1390 1A00 1410 1420 1430 1440 

TACCTCCCTG TCTTTTCTCT CTTCGTCATA CTCTCTTCGT CATAGTGACG CTGTACATAA CATCTCACTA 

1450 1460 1470 1480 1490 1500 1510 

GCATAAGCAC ACATAAAGCA TTGTGGTAAC CAATCAAGGT TGCTCAGCTA GGTCATAAGC AGGAAGGAAA 

1520 1530 1540 1550 1560 1570 1580 

ATCTGGTGTA AATAACGCCA GATCTCACAA CATTCACTCT GAAAAATTTT CCTCCAATTA ATCACAATGT 

1590 1600 1610 1620 1630 1640 1650 

.1 _. CATCAAGATT .TTCTCACCGC_CCT CTjSAACGAC TACTGAAAAG TACCAAGGTA 

1660 1670 1680 1690 1700 1710 1720 

TGTATTTTAT CCAGGAGAGC ACCTTTTTTG CGCCTCGCAG AAGTCCCCAG CCGCCACTAC CTCACCTCCA 

1730 1740 1750 1760 1770 1780 1790 

TAGAGCATCA ACCTCCTAA CTTGATGGTGC CAGGTTCGAC CCCTCCCTCC CCCTCCAATG TGGTTATCCT 

1800 1810 1820 1830 1840 1850 1860 

ATAATCTTAT TACCTCAGT GTCAGGCTGAT OATCTCGGTT CGACTCCCAC TGACCACTTC ACTTTTGAAT 

1870 1880 1890 1900 1910 1920 1930 

AAGTATTCTC TCCCAACCC TGTTACAGAAT AATTTCATTT ATTACGTGAC AAGATAGTCA TTTATAAAAA 

1940 1950 1960 1970 1980 1990 2000 

ATGCACAAAA ATGTTATTG TCTTTTATTAC TTCTGACTTG TAGATTTTTC TTATGCGGTG AATCCCCCTT 

2010 2020 2030 2040 2050 2060 2070 

- TGCGGCGGGC CCTCCAGTC AAATACTTAAT CTTCCTCGCG AACCATATTG ACTCTCCTAT CGTTCACCGG 

2080 ' 2090 2100 2110 2120 2130 2140 

CACGCACCCG GCACCCCAA TTTTTTATAAA ATCAAATTCA CACCCTATGG TTCAGAGCGG TCTCTTTTTA 

2150 2160 2170 2180 2190 2200 2210 

CATCACCTCG GCAACCATA ATGGAGGTTAA CTTGAAAGAT ACCATCAATA GCAGAAACCA GTGATTTCCT 

2220 2230 2240 22SO 2260 2270 2280 

TTATCCCCTG GCGATTTAA CCGCGCCAGAG CGTATGCAAG ACCCTCCCCC CCTTGGCCCG TGATCCTTCA 



2290 2300 2310 

ATAGTCCGAA TATGAATCG TTACCACCCCC TGCGAATTC Q S«f 04*** lA *Jt. 
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